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Priority-based task reassignments in hierarchical 2D 
mesh-connected systems using tableaux 



Dohan Kim 



Abstract 

D Task reassignments in 2D mesh-connected systems (2D-MSs) have been 

researched and simulated for several decades. We propose a hierarchical 2D 
mesh-connected system (2D-HMS) in order to exploit the regular nature of a 
2D-MS. In our approach priority-based task assignments and reassignments 
in a 2D-HMS are represented by tableaux and their algorithms. We provide 
examples of priority-based task reassignments in a 2D-HMS in which task 
relocations are simply reduced to a jeu de taquin slide. 
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^ 1. Introduction 
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A distributed system is a collection of processors connected by an ar- 
bitrary interconnection network [H [2]. Among various interconnection net- 
works for distributed systems, two-dimensional (2D) mesh has received ex- 
^ tensive study due to its simplicity, efficiency, and structural regularity [SHU]. 

^ Some data structures, such as matrices and arrays, naturally fit into a 2D 

^ mesh-connected system (2D-MS) [6J. Since tasks are often assigned to a 

j^ sub mesh in a 2D-MS, continuous submesh allocations and deallocations of 

different sizes may cause fragmentation |3l H] in a 2D-MS. Task relocation 
is an approach to decrease fragmentation by reassigning a running task to 
an idle processor in a 2D-MS, which involves capturing and transferring the 
state of the running task to the idle processor [3l [7j. Meanwhile, the main 
objectives of task assignments and reassignments in distributed systems are 
their applications to high performance computing (TJ [8] . In a similar vein our 
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objective of task reassignments in a 2D-MS is to minimize task turnaround 
time rather than reducing fragmentation. 

For decades, a wide variety of ways to tackle task assignment and re- 
assignment problems have been researched, such as graph-theoretic [9HT3], 
mathematical programming [T3], and heuristics fW\ [T7] . One of the com- 
mon methods to represent and solve a task assignment problem is a graph- 
theoretic method, which uses a graph-matching algorithm between a task 
graph and a processor graph [914TT]. To complement the graph-theoretic 
method, our previous work |T8] presented the Young tableaux [T9H22] ap- 
proach to representing task assignments. In this paper we convert a 2D-MS 
into a 2D mesh processor graph. Then, a 2D mesh processor graph is repre- 
sented by a Young diagram [19], El], and its task assignment is represented by 
a tableau. Our greedy task relocation policy is based on a hierarchical 2D- 
MS in which task relocations are performed systematically by using tableau 
algorithms. 

The remainder of this paper is organized as follows. Section |2]presents the 
problem statement along with its assumptions. We provide an introduction 
to tableaux and their algorithms in Section |3j In Section |4] we present how 
2D mesh graphs are converted into tableaux. Section [5] shows how task 
relocations under the greedy task relocation policy in a 2D-HMS are reduced 
to a jeu de taquin slide, and how they are applied in a 2D-HMS. Section [6] 
gives an application of our approach to priority-based task reassignments in 
a 2D-HMS. Finally, we conclude in Section [7) 

2. Priority-based task reassignment problem in a 2D-HMS 

We first give the necessary definitions of task assignments in a heteroge- 
neous system. Then, we present our priority-based task reassignment prob- 
lem in a hierarchical 2D mesh-connected system along with its assumptions. 

Definition 2.1 ([H]). A task graph T = {V,E) is a directed acyclic graph, 
where each vertex in V = {i;i,W2, . . . ,fn} represents a task, and each edge 
{vi, Vj) G E C V X V represents the precedence relationships between tasks, 
i.e., Vj cannot begin its execution before Vi completes its execution. 



Definition 2.2 ( [23H25] ). A heterogeneous system P is a set of m heteroge- 
neous processors P = {pi,P2, ■ ■ ■ ,Pm} whose communications are described 
by a network topology. If T = (V, E) is a task graph, then the execution 
time of Vi & V on pj G P is the function u : V x P ^ Q+. A heterogeneous 



system P is said to be consistent if processor p^ & P executes a task n times 
faster than processor py G P, then it executes all other tasks n times faster 
than processor py. 

Let T = (V, E) be a task graph and P = {pi,P2, ■ ■ ■ ,Pm} be a heteroge- 
neous system. A startup cost of initiating a task on a processor is assumed 
to be negligible. In a consistent system, the computation cost of task Vg on 
Pt is defined by u{vs,Pt) = r{vs)/c{pt), where r{vs) is the computation or 
resource requirement of task Vg, and c{pt) is the execution rate or capacity 
of processor pt pU |26] . In an inconsistent system, which is a generalization 
of a consistent system, the computation cost of task Vg on pt is defined by 
u{vs,pt) = Wst in which Wst is the corresponding (s,t)*'^ entry in a \V\ x \P\ 
cost matrix W [21]. In the remainder of this paper, we assume that every 
2D-MS is consistent. 

Definition 2.3 ([271 [28j). A 2D mesh-connected system consists oi m x n 
processors structured as a rectangular grid of height m and width n. Each 
processor is addressed by its coordinate {i,j) ioi 1 < i < m and 1 < j < n. 
An internal processor {x,y), where 1 < x < m and 1 < y < n, is directly 
connected to its four neighboring processors (x — l,y), (x -|- l,y), {x,y — 
1), {x,y + 1). A processor in the four corners has two neighboring pro- 
cessors, while a processor in the remaining boundary has three neighboring 
processors, respectively (see Fig. [SJ^d) in Section |4]). 

Definition 2.4. A hierarchical 2D mesh- connected system oi m x n hetero- 
geneous processors is a heterogeneous 2D mesh-connected system with the 
following partial order: 

P{i - 1, j) ^p Pii,j), Pit, J - 1) ^p P{t,j), l<i<m, Kj <n, 

where P{a, h) -<p P{c, d) means that a processor addressed by (a, h) has a 
higher priority (or execution rate) than a processor addressed by (c, d). 

Definition 2.5 ([914TT]). Let T^ = {^1,^2, • • • ,^m} be a set of m tasks with 
or without precedence constraints and i?„ = {pi,p2,- • • ,Pn} be a set of n 
processors. Let ^4 be a task assignment function between T^ and i?„. Let 
^^(^4) denote the total execution time of processor p for the task assignment A 
and let fp{A) denote the total idle time of processor p for the task assignment 
A. The turnaround time of processor p for the task assignment A is the 

total time spent in the processor p for the task assignment A. Let tp{A) = 



tp{A)+tp{A) and t{A) = m.a.Xptp{A). We call t{A) the task turnaround time 
of the task assignment A. An optimal task assignment is the task assignment 
A' that minimize the task turnaround time: 

t{A') = ramt{A) = minmaxtplA) . 

A A p 

We next define the necessary conditions for being an optimal task assign- 
ment in a 2D-HMS. 

Definition 2.6. Let T^ = {1,2,..., m} be a set of m tasks having priorities 
represented by task IDs from 1 to m, where a lower task ID indicates a higher 
priorit}Q A total order relation -<t is defined on Tm as follows, ti -<j t2 for 
any two tasks ti G Tm and ^2 £ T^ means that ti has a higher priority than t2- 
Let Rn be a 2D-HMS consist of ra (ra > m) heterogeneous processors whose 
priorities (or execution rates) of rows and columns are sorted in descending 
order. Let A be a bijective, priority-based task assignment between T^ and 
m processors in i?„. The necessary conditions for being an optimal task 
assignment between Tm and m processors in i?„ are defined as: 

(i) Priorities of the assigned tasks on processors strictly decrease in rows 
and columns, i.e., ti -<t t2 whenever A{ti) -<p A{t2) for any two tasks 
ti e Tm and ^2 e Tm- 

(ii) Processors of A{Tm) in i?„ is left-justified, where the row sizes of A{Tm) 
are weakly decreasing. 



The first condition in Definition 2.6 ensures that a processor with a higher 



priority executes a task with a higher priority. The second condition in 



Definition 2^ ensures that if one processor is idle and the other processor is 
busy for two adjacent processors in a 2D-HMS, the processor with the lower 
priority is chosen to be idle. We say that a task assignment or reassignment A 
in a 2D-HMS is necessarily optimal if it satisfies the necessary conditions for 
being an optimal task assignment in a 2D-HMS. However, these conditions 
are not universal for task assignments in 2D-MSs in that they depend on 
priority schemes and system characteristics. We focus on priority-based task 
assignments in a 2D-HMS in which an optimal task assignment for priority- 



based tasks satisfies the necessary conditions in Definition 2.6 Now we define 



the priority-based task reassignment problem in a 2D-HMS. 



^In some priority schemes |24| 129] . a higher number indicates a higher priority. 
Throughout this paper, it is assumed that a lower number indicates a higher priority. 



The priority-based task reassignment problem in a 2D-HMS is to find a 
necessarily optimal task reassignment sequence {Ak)^SQ for the given initial 
task assignment A = Aq along with the task completion sequence (fcj)^i, 
where 6j G T^ for 1 < i < m. The constraints and assumptions that we have 
made are as follows: 

(1) Both tasks and processors are heterogeneous. 

(2) Each processor can process at most one task at a time. Processors are 
dedicated and consistent to the priority-based task assignment, where 
no other task is processed when the priority-based task assignment is 
executed. 

(3) Each priority-based task reassignment is achieved by an iterative se- 
quence of task relocations, where each task relocation is allowed between 
two adjacent processors in a 2D-HMS if one processor is in idle state and 
the other node is in busy state. 

(4) Two task relocations do not occur simultaneously in a 2D-HMS. 

(5) Task relocation and communication costs are ignored. 

(6) The number of tasks are less than or equal to the number of the available 
processors in a 2D-HMS. Otherwise, a clustering [26] of tasks needs to 
be employed as an intermediate step, which is beyond the scope of this 
paper. 

Most of the traditional approaches [31 HI El |271 EH] are based on the 
assumptions that processors in a 2D-MS are homogeneous. Further, the 
topology of the assigned tasks on a 2D-MS is restricted (e.g., rectangular 
or square-mesh shape) and priority-based task assignments have not often 
been considered [21 II]. Thus, there is a lack of systematic mechanisms of 
task relocations in a 2D-HMS. We use tableaux and their algorithms for the 
priority-based task reassignment problem in a 2D-HMS. When representing 
a 2D-MS or a 2D-HMS, there is also a tradeoff between using a Young dia- 
gram (or a tableau) versus a 2D mesh graph. Since a Young diagram or a 
tableau is intended to represent a 2D mesh graph in a compact manner, it 
does not efficiently express communication costs in a 2D-HMS. If we ignore 
communication costs, a 2D mesh graph has an equivalent form of a Young 
diagram or a tableau. In the next section we give an introduction to tableaux 
and their algorithms. 



3. Tableaux and their algorithms 

A tableau is one of the essential tools to study combinatorics [211 130] , 
representation theory [201 122], symmetric functions [IHl EI], etc. We first 
introduce the required definitions of tableaux and their combinatorial algo- 
rithms. 

Definition 3.1 ([19]). A partition of n is defined as a sequence 
A = (Ai, A2, . . . , Aj), where the A^ are weakly decreasing and ^^^^ A^ = n. 
If A is a partition of n, then we write A h n. 

Definition 3.2 ( [ISl US [22] ) • Let A = (Ai, A2, . . . , A^) h n. A Young diagram 
(or Ferrers diagram) of shape A is a left-justified, finite collection of cells, with 
row j containing Xj cells for 1 < j < i. 

Definition 3.3 ([IS]). Let A be a partition. An inner corner of the Young 
diagram of shape A is a cell {i,j) € A whose removal leaves the Young diagram 
of a partition. An outer corner of the Young diagram of shape A is a cell 
{i,j) ^ A whose addition results in the Young diagram of a partition. 

Definition 3.4 ( [IHl [221 132] ) . Let A h n. A tableau t of shape A is a Young 
diagram of shape A filled with a set of elements, often positive integers. A 
Young tableau T of shape A is a tableau of shape A whose entries are the 
numbers from 1 to n, each occurring once. 

Definition 3.5 ([22j). A standard Young tableau is a Young tableau whose 
entries are strictly increasing in rows and columns. 

Definition 3.6 ([19]). Let A h n. A partial tableau p of shape A is a tableau 
whose entries are strictly increasing in rows and columns. 

Note that a partial tableau p in Definition |3.6| is the standard Young 
tableau if the entries of p are exactly {1,2,..., n}. 



375] 

2 . We see that ti is standard. 



Example. Consider ti = 2 ^ \ ^2 = 

but t2 is not. 

The number of standard Young tableaux of the given shape A is obtained 
from the hook formula. 



Definition 3.7 ([201 I22])- li i' = {i,j) is a cell in the Young diagram of 
shape A, then the hook of p, denoted H^,, is the set of all cells directly to the 
right of z/ or directly below u, including u itself: 



H. 



Hi,j 



{i^,J')■■J'>J}U{i^,J)■.^'>^}. 



The hook length of z/ = {i,j), denoted htj, is the number of cells in its hook, 



I.e., 



h 



«j 



H., 



«ji 



Theorem 3.1 ([33j). If X \- n, then the number f^ of standard Young 
tableaux of shape A is 



f 



n\ 



ll(ij)GA"'J.i 



D 



Example. Labeling each cell with its hook length for the Young diagram of 
shape (3, 2, 1) and (4, 4, 4, 4) are given below: 
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If the shape is A = (3, 2, 1), then /(3'2.i) = 6!/(5 ■ 3^ ■ l^) = 16. If the shape 
is A = (4, 4, 4, 4), then /(^AM) = xgi/^y . 6^ ■ 5^ ■ 4^ ■ 3^ ■ 2^ ■ 1) = 24024. 

Definition 3.8 ([HI |2T]). Let ^ = (/ii,/i2, • • • ,/ij) and A = (Ai, A2, . . . , A-,) 
be partitions with /x C A (i.e., i < j and /ifc < A^ for 1 < k < i). Then a 
s/i;ew shape of A//i is the set of cells 

A//i = {c : c G A and c ^ /i}. 

A skew shape of A//i is normal if /i = 0. A tableau of skew shape A//i is 
called a skew tableau of shape A//i. A partial skew tableau of shape A//i (or 
a partial tableau of skew shape A//i) is a skew tableau of shape A//i whose 
entries are strictly increasing in rows and columns. A partial skew tableau 
is called the standard skew tableau if its entries are precisely {1, 2, . . . , n}. 

Example. Consider skew tableaux of shape X/fi = (4, 3, 3, 2)/(2, 2) given by 
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We see that ti is a partial skew tableau, but ^2 is not. 



Definition 3.9 ( 

binary operation 



). A group (G, ■ ) is a nonempty set G, closed under a 
such that the following axioms are satisfied: 



(i) {a ■ b) ■ c = a ■ {b ■ c) for all a,b,c E G, 

(ii) There is an element e E G such that for alla;GG, e ■ x = x ■ e = x, 
(iii) For each element a E G, there is an element a^^ G G such that a-a~^ = 
a~^ ■ a = e. 

Definition 3.10 ([35j). The group of all bijections /„ — )■ In, where /„ = 
{1,2,..., n}, is called the symmetric group on n letters, denoted as ©„. 

Since &n is the group of all permutations of /„ = {1,2, ... ,n}, the order 
of &„ is n! E 



Definition 3.11 (|3ll [35]). Let 11,12, ... ,in be distinct elements of /„ = 
|1, 2, . . . , n\. Then ( • • ' ' ' V'] = \iii2---in\^ &n denotes the per- 
mutation that maps 1 \-^ ii,2 \-^ i2, . . . ,n ^^ in. 

Now we discuss combinatorial algorithms of tableaux, including row in- 
sertion, Robinson-Schensted, and jeu de taquin. We first describe the row 
insertion algorithm. The row insertion or row bumping algorithm [20] takes 
a partial tableau P, and a positive number x that is not in P. Then, it 
constructs the new partial tableau P'. The procedure of row insertion is 
described by Algorithm [T| which always yields a partial tableau. Fig. [Tj de- 
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Fig. 1. An example of row insertion. 



scribes the procedure of row-inserting x = 5 into P. Entries that are bumped 
during the row insertion are indicated by boldface type in Fig. [1} The row 
insertion procedure is always reversible. If we have P' along with the address 
of the added cell in P', we can restore the original partial tableau P. The 
procedure of reverse bumping [2D] is described in Algorithm [2J 



Algorithm 1: Row insertion (Row bumping) fi9[ |20] 
Input: a partial tableau P and an input number x that is not in P 
Output: a partial tableau P' {x is inserted into P) 
begin 

set i := 1 as the first row of P; 

while X is less than some element of row i do 

let y be the smallest element of row i that is greater than x; 

replace y by x; set x := y and i := i + 1; 

end 

place X (with its new cell) at the end row of i; / / x is the largest 

element of row i 
end 



Algorithm 2: Reverse bumping [20j 



Input: a partial tableau P' and number x = Plj in P' (cell {i,j) has 
to be an outer corner in the underlying Young diagram of P') 
Output: a partial tableau P 
begin 

set x' := X and k := i — 1; 

remove the cell of x (including x itself) in P'; 

while A; 7^ do 

let y be the largest element of row k that is smaller than x'; 
replace y by x'; set x' := y and k := k — 1; 
end 
end 
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Fig. 2. An example of reverse bumping. 
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P. 



The address of the added cell in P' in Fig. [T] is (3, 3) having an entry 9. 
Fig. |2] shows the procedure of reverse-bumping x = 9 to P'. Note that it 
restores the original partial tableau P in Fig. [T] 

We next describe the Robinson-Schensted algorithm. The Robinson- 



Schensted algorithm [121 ES] provides a bijection between elements of the 
symmetric group &n and pairs of standard Young tableaux of the same 
shape X \- n. Recall that [ii 12 ■ ■ ■ in] £ &n denotes the permutation that 
maps 1 I-)- ii,2 1— )■ Z2,...,n 1— )■ in- For each tt = [ii Z2 ■■■in] G &n, the 
Robinson-Schensted algorithm constructs a sequence of Young tableaux pairs 
(0,0) = iPo,Qo),{Pi,Qi),...,{Pn,Qn) = iP,Q) (see Algorithmic. By the 
row insertion algorithm, ii,i2, ■ ■ ■ ,in are inserted into the P's. Meanwhile, 
numbers 1,2, ...,n are simply placed sequentially to the Q's so that the 
shape of Pk and the shape of Qk are the same for 1 < A; < n [19]. We call 
P the insertion tableau and Q the recording tableau of n, denoted -P(7r), and 
(5(vr), respectively [T^ I 



Algorithm 3: Robinson-Schensted algorithm (vr — t- {P,Q)) [13 



Input: TT = [xi X2 ■ ■ ■ a;„] G ©„ 

Output: P = Pn and Q = Qn (the insertion tableau and the 

recording tableau of n, respectively) 
begin 

set(Po, Qo):=(0,0); 
for fc ■(— 1 to 72 do 

construct Pk by inserting Xk into Pk-i] place /c into Qk-i so 

that the shape of Pk and Q^ are the same; 
end 

return (P„, Qn); 
end 



Note that the insertion tableau P and the recording tableau Q oi tt = 
[h ^2 ■ ■ ■ in\ ^ ©n are both standard Young tableaux of the same shape. The 
procedure of Algorithm [s] is reversible, i.e., {P,Q) — t- tt. Given {Pk-,Qk), we 
retrieve the fcth element of vr (i.e., ik) from (P^, Qk)-, and find Pfc_i with the 
ifc removed by using the reverse bumping algorithm (see Algorithm |2]) . We 
easily find Qk^i from Qk by removing k (along with its cell) from Qk- Thus, 
given (P, Q) = {Pn, Qn), we construct a sequence of standard Young tableaux 
pairs (P„, Qn), (Pn-i, Qn-i), - - - , {Po, Qo) = (0, 0), where all the elements of 
TT G &n are recovered in reverse order [19]. The theorem about the Robinson- 
Schensted algorithm is as follows. 

Theorem 3.2 ([211 EB]). The Robinson-Schensted algorithm establishes a 
bijection between elements of &n and pairs of standard Young tableaux of the 

10 



same shape X\- n. In other words, "^xhri (/^) ~ ""-'' where f^ is the number 
of standard Young tableaux of shape X\- n. D. 

Note that the bijection in the Robinson-Schensted algorithm is given 
by TT — 7- (P, Q) rather than tt — > P. Each insertion tableau P of vr = 
[xi X2 ■ ■ ■ Xn] G ©n corresponds to an equivalence class of 6„, called a P- 
equivalence class. 

Definition 3.12 ([19]). Two permutations vr = [xi X2 ■ ■ ■ a;„] G 6„ and 
''' = [yiy2 ■ ■ ■ y'n\ G ©« are called P-equivalent if P(vr) = P{t), denoted 
n =pr. 

Example. Both [213] G 63 and [2 31] G ©3 have the same insertion tableau, 
i.e., [213] ^p [2 31]. Similarly, [13 2] G 63 and [312] G 63 have the same 
insertion tableau, i.e., [1 3 2] =p [3 1 2]. 

Now we discuss Knuth- equivalence classes of 6„ and their relationship to 
P-equivalence classes of ©„. 

Definition 3.13 (|2T]). Suppose x < y < z. A Knuth transformation of a 
permutation vr G &n is a transformation of vr G ©„ into another permutation 
T & Gn that has one of the following forms: 



(ii) vr = [xi 
(iii) vr = [xi 
(iv) vr = \xi 



■ yXZ ■■■ Xn]eGn =^ T =[xi ■■■ yZX ■■■ Xn] e&n, 

■ yZX ■■ ■ Xn] e&n =^ T = [Xi ■ ■ ■ y X Z ■ ■ ■ Xn] e ©n, 

■ XZy ■■ ■ Xn]eGn =^ T = [Xi ■ ■■ ZXy ■■■ Xn] e &n, 

■ ZXy ■■ ■ Xn] e&n =^ T = [Xi ■ ■■ XZy ■■■ Xn] e &n- 



Two permutations it,t & (5„ are called Knuth- equivalent if one of them can 
be obtained from the other by a sequence of Knuth transformations, denoted 

vr =K T. 

Example. We see that [213] G ©3 and [2 31] G ©3 are Knuth-equivalent by 
(g and ^ in Definition [3l3| written [2 1 3] =k [2 31]. Similarly, [1 3 2] G 6 



and [312] G ©3 are Knuth-equivalent by (iii) and divj) in Definition 3.13 
written [13 2] =/,- [312]. 

From the preceding examples, we observe that some permutations in ©3 
are Knuth-equivalent if they are P-equivalent, and vice versa. More generally, 
P-equivalence classes and Knuth-equi valence classes coincide in ©„. 
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Theorem 3.3 ( PTl [50]). Permutations in (3„ are Knuth- equivalent if and 
only if they are P- equivalent. D 

Definition 3.14 ([IHIEI])- Let t be a tableau. The reading word or row word 
of t, denoted r(t), is the permutation of entries of t obtained by concatenating 
the rows oft from bottom to top, i.e., r(t) = RkRk-i ■ ■ ■ Ri, where Ri, . . . ,Rk 
are the rows of t. 

Lemma 3.1 (|2I])- If '^ ^■^ the reading word of a standard Young tableau T, 
thenT zs P{tt). D 

The jeu de taquin (or the "teasing game") of Sciitzenberger [211 EH con- 
sists of a set of rules for transforming partial tableaux, while some properties 
of partial tableaux are preserved during transformations. There are two kinds 
of jeu de taquin slides, which are a forward and a backward slide. A forward 
jeu de taquin slide is described in Algorithm |4| Note that the resulting 
tableau of a forward jeu de taquin slide is still a partial tableau. 



Algorithm 4: A forward jeu de taquin slide [TOl [27 



Input: a partial tableau P of skew shape A//i; an inner corner of /x 

Output: a partial tableau P' 

begin 

pick X to be an inner corner of fi; 
while X is not an inner corner of A do 
if X = {i,j) then 

let x' be the cell of min{Pj+i j , Pj j+i}; 
// if only one of Pi+ij and Pij+i exists, then choose that 
value as a minimum; 
end 

slide Pre' into cell x and set a; := x'; 
end 

return the partial tableau P'; 
end 



A backward jeu de taquin slide can be an invertible operation of a forward 
jeu de taquin slide by performing the backward slide into the cell that was 
vacated by the forward slide. A backward jeu de taquin slide is described in 
Algorithm [5j 
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Algorithm 5: A backward jeu de taquin slide [19 
Input: a partial tableau P of skew shape A//i; an outer corner of A 
Output: a partial tableau P' 
begin 

pick y to be an outer corner of A; 
while y is not an outer corner of ix do 
if y = {i-ij) then 

let y' be the cell of max{Pj_ij , Pjj_i}; 
//if only one of P, 
value as a maximum; 
end 

slide Py' into cell y and set y : 
end 

return the partial tableau P'\ 
end 



1 j and Pjj-i exists, then choose that 
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Example. Consider the following jeu de taquin slides for x = (1, 1) and 
y = (2,4)inP: 



P: 



If we denote the forward jeu de taquin slide on P from cell x = (1, 1) in 
P as jdt^(P), and backward jeu de taquin slide on P from cell y = (2,4) 
in P as jdty(P), then both jdt'^(P) and jdtj^(P) are still partial tableaux. 
Furthermore, if we let cell (3, 3) of jdt''(P) as z, we have jdt^(jdt^(P)) = P, 
which restores P. 



Definition 3.15 ([2T]). Partial tableaux P and P' are called jeu de taquin 
equivalent, written P =jdt P', if P' can be obtained from P by some sequence 
of jeu de taquin slides, or vice versa. 

Theorem 3.4 ([191 EZ])- The jeu de taquin equivalence class of a given 
partial skew tableau P contains exactly one partial tableau of normal shape, 
denoted j{P). D 



Theorem 3.4 says that jeu de taquin slides bring a partial tableau P of 



skew shape to a partial tableau j(P) of normal shape. Recall that a skew 
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shape A//i is a normal shape if /i = 0. The following results show that the jeu 
de taquin equivalence relation is closely connected to both the P-equivalence 
and the Knuth-equivalence relation. 

Lemma 3.2 (|2I])- Each jeu de taquin slide converts the reading word of a 
standard skew tableau into a Knuth- equivalent one. D 

Theorem 3.5 (|2T1|37]). Let P and P' he standard skew tableaux. They are 
jeu de taquin equivalent, i.e., P =jdt P' , if and only if their reading words 
are Knuth- equivalent, i.e., r{P) =k r{P'). 



Proof. P ^^at P'^jiP) = Jin (by Theorem 3.4) ^ r{j{P)) ^k rUiP')) 



(by Lemma 3.1 and Theorem 3.3) <^ r{P) =k r{P') (by Lemma 3.2). D 



4. 2D mesh graphs and tableaux 

Recall that G' = {V',E') is a subgraph of G = {V,E) if ^ C V and 
E' G E pJ8]- A vertex-induced subgraph G' is the subgraph of G that contains 
all edges of G that join any two vertices in V of G' |3Hl [39]. In this section 
we present a certain type of vertex-induced subgraphs of 2D mesh graphs 
and their relationship to tableaux. 

Definition 4.1. A 2D mesh graph G of shape A = (Ai, A2, . . . , A^) h ra is a 
left-justified, a vertex-induced subgraph of a fc x Ai 2D mesh graph, where 
each row contains Aj vertices for 1 < i < A;. A 2D mesh graph G = (V, E) 
of shape A is directed if all edges of G are directed in such a way that the 
horizontal edges are directed from left to right and the vertical edges are 
directed from top to bottom. 

Similarly to the traditional 2D mesh graphs, each vertex of 2D mesh graph 
G = (y, E) of shape A = (Ai, A2, . . . , A^) h n is represented by its coordinate. 
For example, a top-left corner vertex is (1, 1) while a bottom-right corner 
vertex is (/c, \k). If Ai = A2 = ■ ■ ■ = A^ for shape A = (Ai, A2, . . . , A^) h n, we 
say that A is of canonical shape. 

Definition 4.2. Let Gi = {Vi,Ei) be a 2D mesh graph of shape A and 
G2 = (V2, E2) be a 2D mesh graph of shape fi, where fi = (/ii, /i2, • • • , /^i) C 
A = (Ai, A2, . . . , Xj), i.e., i < j and /ifc < Ajt for 1 < A; < i, respectively. A 
2D mesh graph G' of skew shape A//i is the vertex-induced subgraph of Gi, 
where the vertex set of G' consists of Vi — V2. 
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Fig. 3. Conversion of a 2D mesh graph into a Young diagram or a Young tableau. 



For example, the upper figures in Fig. [sFa), (b), and (c) show 2D mesh 
graphs of shape Ai = (4,4,4,4), A2 = (4,3,2,2), and A3 = A//x where 
A = (4, 4, 4, 4) and fi = (2, 2), respectively. Now we consider the relationships 
between 2D mesh graphs and Young diagrams (or tableaux). For example, 
if we do not take communication costs into account for a 2D mesh proces- 
sor graph, it has the compact form represented by a Young diagram or a 
tableau. In order to represent a 2D mesh processor or task graph of shape A 
in a compact manner, we define a 2D mesh Young diagram and a 2D mesh 
tableau. 

Definition 4.3. A 2D mesh Young diagram of shape A = (Ai, A2, . . . , Afc) h 
n is a Young diagram of shape X \- n, where each cell {i,j) corresponds to 
each vertex (^, j) in the 2D mesh graph G = {V, E) of shape X\- n. 

Definition 4.4. A 2D mesh Young diagram of skew shape X/fi is a skew 



Young diagram of shape A//i for /i C A. Similarly to Definition |4. 3 , each cell 
{i,j) corresponds to each vertex {i,j) in the 2D mesh graph G = {V,E) of 
skew shape X/fi. 

Fig. |3] shows how 2D mesh graphs are converted into 2D mesh Young 
diagrams. Fig. p[a) shows a 2D mesh Young diagram of canonical shape 
while Fig. Islc) shows a 2D mesh Young diagram of skew shape. We next 
define a 2D mesh tableau in order to represent a labeled 2D mesh graph (see 
Fig. id)). 
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Definition 4.5. A 2D mesh tableau of shape A = (Ai, A2, . . • , Xk) l~ n is a 
Young tableau of shape A h n, denoted as (MTjj) of shape A or A-(MTjj), 
where each cell has a unique number from 1 to n and its underlying Young 
diagram is a 2D mesh Young diagram of shape X\- n. 

Definition 4.6. A 2D mesh tableau of skew shape X/fi is a tableau of skew 
shape A//i, denoted as (MTj^) of shape A/yU, where each cell has a unique 
number from the set {1,2, ... ,n} and its underlying Young diagram is 2D 
mesh Young diagram of skew shape A//i. 

A graph-theoretic approach to Young diagrams or tableaux has already 
been researched [10] • It focuses on graphs having the shape of a Young dia- 
gram or a tableau, while this paper focuses on converting a 2D mesh graph 
into a Young diagram or a tableau. The relationship between a certain label- 
ing of directed acyclic graphs and the number of standard Young tableaux 
has been discussed in [HI |12] . The following propositions involve in counting 
aspects of representing 2D mesh task and processor graphs, respectively. 

Proposition 4.1. Let G = (y,E) be a directed 2D m,esh graph of shape 
\\- n. Let T = {1, 2, . . . ,n} be a totally ordered set of n tasks ordered by the 
"less than" relation <. Suppose we assign tasks T to vertices V bijectively 
such that task ID of vertex x is less than task ID of vertex y for each edge 
{x,y) G E. Then, the number of such assignments is the number of standard 
Young tableaux of shape \\- n. D 

Proposition 4.2. Let G = {V,E) be an undirected 2D mesh graph of shape 
X\- n. Let P = {1,2, . . . ,n} be a totally ordered set of n processors ordered 
by the "less than" relation <. Suppose we assign processors P to vertices 
V bijectively such that processor IDs of vertices are increasing in rows and 
columns. Then, the number of such assignments is the number of standard 
Young tableaux of shape X\- n. D 



Proposition |4.1| and |4.2| directly follow from the definition of a standard 
Young tableau of shape X \- n. Each labeled 2D mesh task and processor 
graph of shape X \- n corresponds to each 2D mesh tableau of shape X \- n. 
We call a 2D mesh task graph of shape X \- n satisfying the hypothesis of 



Proposition ^Tla standard 2D mesh task graph of shape X\- n, and a processor 



graph of shape X\- n satisfying the hypothesis of Proposition |4.2| a standard 
2D mesh processor graph of shape X \- n, respectively. Thus, counting the 
number of standard 2D mesh task or processor graphs of shape A h n are 
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reduced to counting the number of 2D mesh tableaux of shape X\- n whose 
entries in rows and columns strictly increase, i.e., /^ (see Theorem 3.1). 



Now consider a task assignment between a standard 2D mesh task graph 
of shape X \- n and a standard 2D mesh processor graph of the same shape 
by their coordinates, i.e., each task addressed by (z, j) of the 2D mesh task 
graph is assigned to each processor addressed by {i,j) of the 2D mesh pro- 



cessor graph. Proposition |4.3| discusses the counting aspects of such task 
assignments. 

Proposition 4.3. Let X \- n. The number of bijective task assignments by 
their coordinates between standard 2D mesh task graphs of shape X \- n and 
standard 2D mesh processor graphs of shape X\- n is (/'*') . Moreover, if we 
sum all the numbers of such task assignments for all possible partitions (or 
shapes) of n, then the total number is J2x\-n (/'^) ~ ''^'■ 



Proof. By Proposition AA_ and |4.2[ the number of standard 2D mesh task 
and processor graphs of shape A h n is both /'*'. Thus, for a given shape 
X \- n, the number of bijective task assignments by their coordinates be- 
tween standard 2D mesh task graphs of shape X\- n and standard 2D mesh 
processor graphs of shape A h n is (/'*') . For a given n, if we consider 
all possible partitions (or shapes) of n, then the t otal number of such task 

D 



assignments is X^Ai-n (/^) ' which is n! by Theorem 



3.2 



In case a 2D mesh graph represents a 2D mesh processor graph, we say 
that the associated 2D mesh Young diagram (respectively, 2D mesh tableau) 
is the 2D mesh processor diagram (respectively, 2D mesh processor tableau). 

Definition 4.7. A 2D mesh processor diagram of shape A h ra is a 2D 
mesh Young diagram of shape A h n, where each cell in the 2D mesh 
processor diagram represents each processor in the heterogeneous system 
P = {Pi: P2, ■ ■ ■ , Pn}- Each cell is assumed to have direct communication 
links with its neighboring cells and have a routing capability, allowing it to 
send and receive messages to and from any other cell in the diagram. 

To exploit the regular nature of a 2D mesh topology, we consider a hier- 
archical 2D mesh processor diagram of canonical shape, where the rows and 
columns of heterogeneous processors are sorted in descending order by their 
priorities (or execution rates). 
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Definition 4.8. A hierarchical 2D mesh processor diagram of canonical 
shape A = (Ai, A2, . . . , A^) for Ai = A2 = • ■ ■ = A^ is a 2D mesh proces- 
sor diagram consists of A; x Ai processors with the following partial order: 

P(z - 1, j) ^p P{i,j), Pit, J - 1) ^p P{i,j), l<t<k,l<j<\i, 

where P{a, h) -<p P{c, d) means that a processor addressed by (a, h) has a 
higher priority (or execution rate) than a processor addressed by (c, d). 

Proposition 4.4. Let A = (Ai, A2, . . . , A^) h n for Ai = A2 = • • • = A^, and 

let P = {1, 2, . . . , n} he a totally ordered set of k x Xi processors ordered by 
the "less than" relation <. Each processor ID represents a processor priority, 
where u < v for any two processors u E P and v E P implies that processor u 
has a higher priority (or execution rate) than processor v . Then, the number 
of ways to organize a hierarchical 2D mesh processor diagram of shape \\- n 
by arranging n processors in P is f^ . 



Proof. It immediately follows from Definition 3.5 and 4.8 D 



We define a hierarchical 2D mesh tableau in order to represent a priority- 
based task assignment and its reassignments in a 2D-HMS. 

Definition 4.9. A hierarchical 2D mesh tableau of shape A h n is a tableau of 
shape A, denoted as (HMTjj) of shape A or A-(HMTjj), whose underlying 2D 
mesh Young diagram is a hierarchical 2D mesh processor diagram of canonical 
shape \\- n. Entries of A-(HMTj.,) are task IDs from the set {1,2, . . . ,n} 



with the total order relation -<t (see Definition 2.6). Empty entry of a cell 
is allowed in A-(HMTj j), while all non-empty entries along with their cells 
must form a tableau of normal or skew shape, called a maximally embedded 
tableau of A-(HMTij). 

Definition 4.10. Let A-(IIMTj ,,) be a hierarchical 2D mesh tableau of shape 
\\- n. If the maximally embedded tableau of A-(HMTjj) is a tableau of nor- 
mal shape, we say that A-(IIMTjj) is of normal shape. Meanwhile, if the 
maximally embedded tableau of A-(HMTjj) is a tableau of skew shape, we 
say that A-(HMTj j) is of skew shape. If the maximally embedded tableau 
of A-(HMTj j) is a partial tableau (i.e., a tableau whose entries are strictly 
increasing in rows and columns), we say that A-(IIMTjj) is standard. Oth- 
erwise, we say that A-(IIMTjj) is generalized. 



In this paper A-(HMTj ,,•) is referred to as a standard A-(HMTj j) of nor- 
mal shape unless otherwise stated. Each entry in A-(HMTjj) denotes a 
task, where a lower task ID indicates a higher priority. Each cell repre- 
sents each processor in the hierarchical 2D mesh processor diagram. There- 
fore, A-(HMTj j) represents a priority-based task assignment in a 2D mesh- 
connected system. Note that a priority-based task assignment represented 
by A-(HMTjj) satisfies the necessary conditions for being an optimal task 



assignment in a 2D-HMS (see Definition 2.6). 



Definition 4.11. Let A-(HMTi ,,) be a hierarchical 2D mesh tableau of shape 
X \- n. If the processor priority P{a, b) is fixed for any cell (a, b) in A- 
(HMTjj), we say that A-(HMTjj) is of the fixed processor priority. 

Although the priorities of underlying processors in A-(HMTj .,■) have to 
decrease in rows and columns by Definition |4.8[ it is not required to specify 
the processor priority for each cell (or processor) in A-(HMTjj). Meanwhile, 
A-(IIMTj j) of the fixed processor priority (see Fig. iTta)) denotes the priori- 
ties of underlying processors in A-(IIMTj j), where the label in the top- right 
corner of each cell denotes the processor priority (cf. [18j). 

5. Priority-based task reassignments in a 2D-HMS 

Consider the priority-based task assignment in Fig. |4](a) represented by 
(HMTj j) of shape A = (3, 3, 3) \- n for n = 9. Assume A-(HMTi j) is not of 
the fixed processor priority. 
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Fig. 4. A sequence of task reassignments for a task completion sequence 
(1,3,2,5,8,4,6,7,9). 
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Algorithm 6: Task reassignments: A-(HMTjj) of normal shape 
Input: An initial task assignment Aq represented by A-(HMTjj) of 
normal shape; a task completion sequence (61, 62, • • • , &m), 
where m > 2, in runtime 
Output: A task reassignment sequence {Ak)^~Q 
begin 

set A-(HMtJJ) :=A-(HMTij); 
for A; f- 1 to ?7i — 1 do 

let t be the maximally embedded tableau of X-(RMTlJ) and yU 
be the shape of t; wait for a completion of task bk in 
A-(HMTj.y); if task bk is completed, then the cell (or processor) 



.(fc)^ 



becomes vacated; set x as the 



of task bk in A-(HMT,\^ 
corresponding cell in X-(HMTl'^-' 
while X is not an inner corner of fi do 
if X = {i, j) then 

let x' be the cell of min{HMTj+\^^. , HMtJ^VJ // if only 
one non-idle cell (or processor) exists in [i + 1, j) and 
{i,j + 1), then choose that cell; 
end 

relocate the task on cell x' into cell x and set x := x'; 
end 

set Afc:=A-(HMTg^), where A-(HMTg^) is of normal shape; 
set A-(HMTf +')):=A-(HMT;*;)); 



end 



return the task reassignment sequence [Ak] 



k=0 ' 



end 



If task 1 in Fig. |4]^a) is completed first, the processor addressed by (1, 1) 
becomes idle. We mark the cell (1,1) as • to show that the processor ad- 
dressed by (1, 1) is now in the idle state. Once a processor is in the idle state, 
it seeks the right and below processor to perform task relocation. Recall that 
task relocation is only allowed between two adjacent processors if one is in 
the idle state and the other is in the busy state. If a processor is in the idle 
state, it does not check the left and above processor to perform task reloca- 
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tion. It is because the underlying 2D mesh processor diagram of A-(HMTj ,,•) 
is hierarchical, it is not an optimal choice if a task is to run on a processor 
with the lower execution rate. Thus, a processor in the idle state always seeks 
both the right and below processor in order to compare task priorities and 
to relocate a task. We see that task HMTi 2 has a higher priority than task 
HMT2,i, i.e., 2 -<t 3. Thus, task relocation involves the processor addressed 
by (1, 1) and the processor addressed by (1, 2). Now task 2 has been relocated 
and the processor addressed by (1, 2) becomes idle. If a processor becomes 
idle, the choice for task relocation between the right and below processor is 
always greedy |13], allowing the task with the higher priority to occupy the 
idle processor (see Fig. |4](b)). This process continues until no task relocation 
is possible, which means that the right and below processor are both idle 
or both not available. The final state of Fig. |4Fb) is the task reassignment 
Ai for the completion of task 1. Given the initial task assignment Aq in 
Fig. 0(a), Fig. nc) shows the task reassignment sequence (A^)^^ for m = 9 
corresponding to the task completion sequence (1, 3, 2, 5, 8, 4, 6, 7, 9). Al- 
gorithm [6] describes the procedure in Fig. |4j Task reassignments take place 
in Algorithm [6] if a task completion sequence is provided in run time. It turns 
out that the iterative greedy task relocation mechanism in Algorithm |6] cor- 
responds to the forward jeu de taquin slide discussed in Algorithm |4| except 
that task relocation starts with the cell that is indicated by the task com- 
pletion sequence. Note that each task assignment in a task reassignment 
sequence (y4fc)^Q^ in Algorithm |6] satisfies the necessary conditions in Defi- 



nition 



2.6 



We see that each task assignment in {Ak)^^Q in Algorithm k\ is 
represented by a hierarchical 2D mesh tableau of normal shape, where task 
IDs are increasing in rows and columns on the underlying hierarchical 2D 
mesh processor diagram. Therefore, (^a:)^^^ in Algorithm p^ is a necessarily 
optimal task reassignment sequence. 

Thus far, we have examined the case where the initial task assignment is 
represented by A-(IIMTjj) of normal shape. Now we consider the case where 
the initial task assignment is represented by A-(IIMTj ,,■) of skew shape. 
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Fig. 5. Task reassignments by using forward jeu de taquin slides. 
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Algorithm 7: Task reassignments: A-(HMTjj) of skew shape 
Input: An initial task assignment Aq represented by A-(HMTj ,,■) of 

skew shape 
Output: A task reassignment sequence {Ak)]^^Q 
begin 

if /3 is a partition of m' for the maximally embedded tableau of 
A-(HMTjj) of skew shape a//3, then set m as the value of m'; set 
A-(HMTSy) :=A-(HMTij); set k := 1; 

while the maximally embedded tableau of \-{HMTlJ) is not of 
norm,al shape do 

if the maximally embedded tableau of A-(HMT,- y) is of (skew) 
shape /i/z/, pick x to be an inner corner of v\ 
while X is not an inner corner of fi do 
if a; = {i, j) then 

let x' be the ceU of min{HMT2\^j. , HMXg^ J // if only 
one non-idle cell (or processor) exists in {i + 1, j) and 
{i,j + 1), then choose that cell; 
end 

relocate the task on cell x' into cell x and set x := x'; 
end 

set v4,,:=A-(HMTg)); set A-(HMTg+'^):=A-(HMTg^); 
k := k + 1; 
end 
return the task reassignment sequence {Ak)^^Q, where Am is the 



task reassignment represented by X-(HMTi^ ) of normal shape; 



end 



Consider a top-left corner of a hierarchical 2D mesh tableau ti or ^2 in 
Fig. |5| where the processor with the highest priority is idle. Thus, it is a 
natural choice to relocate a task from a processor with the lower execution 
rate to a processor with the higher execution rate. Algorithm [7] describes the 
procedure in which the initial task assignment, represented by a hierarchi- 
cal 2D mesh tableau of skew shape, is converted into the task reassignment, 
represented by a hierarchical 2D mesh tableau of normal shape. As shown in 
Fig. [5} ^3 is the maximally embedded tableau of ti, and ^4 is the maximally 
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embedded tableau of t2, respectively. By performing task relocations dis- 
cussed in Algorithm [7] iteratively, the task reassignment ts is obtained from 
the task assignment ti in Fig. |5| Similarly, the task reassignment t^ is also 
obtained from the task assignment ^2 in Fig- [5j If ^2 represents an initial task 
assignment Aq, then t^ corresponds to the task reassignment ^3. According 
to Algorithm [7], the initial choice of task relocation for Ai must target for 
either the cell (1, 2) or the cell (2, 1) in t2- Note that the task reassignment se- 
quence (Aq, ^1,^2, A3) is not uniquely determined, depending on the choice 
of an initial task relocation. However, A3 is uniquely determined by The- 
orem 3.4, since the greedy-based task relocations on A-(HMTj.,) follow the 
forward jeu de taquin slide rules. Unlike a task reassignment represented by 
A-(HMTjj) of normal shape, a task reassignment represented by A-(HMTjj) 
of skew shape do not always satisfy the necessary conditions for being an 
optimal task assignment. For example, non-idle processors of Aq, Ai, and A2 
are not left-justified, which implies that the processor with the higher execu- 
tion rate is idle for some pairs of adjacent processors in a 2D-HMS. However, 
the resulting task reassignment A^ satisfies the necessary conditions for being 
an optimal task assignment in a 2D-HMS. Note that this process is reversible 
by using the backward jeu de taquin slides as discussed in Algorithm [5j Now 
we define an equivalence class of task reassignments up to task relocations 
under the greedy task relocation policy. 

Definition 5.1. Two task assignments ti, represented by a hierarchical 2D 
mesh tableau A-(HMT^j) of skew shape, and ^2, represented by a hierarchical 
2D mesh tableau A-(HMT? •) of skew shape, are called task reassignment 
equivalent up to task relocations (under the greedy task relocation policy 
described in Algorithm [T]), denoted as ti =t ^2, if they have the same resulting 
task reassignment, represented by the same A-(HMTj j) of normal shape. 

Proposition 5.1. Let \-{HMT]j) and X-{HMT^.j) be hierarchical 2D mesh 
tableaux of skew shape whose maximally embedded tableaux are both standard 
skew tableaux. If two task assignments ti, represented by \-{HMT\j), andt2, 
represented by \-(HMTf ■) , are task reassignment equivalent, then the reading 
words of their maximally embedded tableaux are both Knuth- equivalent and 
P-equivalent. 



Proof. Let P be the maximally embedded tableau of A-(HMTj^ ) for the task 
assignment ti, and Q be the maximally embedded tableau of A-(HMT? ■) for 
the task assignment ^2- Since ti =t ^2 by hypothesis, ti and ^2 have the same 
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resulting task reassignment represented by the same A-(HMTj j) of normal 

Let A^ be the maximally embedded tableau of such 
of normal shape. Each task relocation under the greedy task 



shape by Definition |5.1 

A-(HMTij 

relocation policy described in Algorithm [7] follows the forward jeu de taquin 

slide rules described in Algorithm 111 Thus, P =jdt Q by Theorem 13.4 



Since 

P and Q are both standard skew tableaux by hypothesis satisfying P =jdt Q, 
we conclude that the reading words of P and Q are both Knuth-equivalent 
and P-equivalent by Theorem 3.3 and 3.5 D 
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Fig. 6. Insertion tableaux for the reading word of is and the reading word of t^ 
in Fig. I5j 

For example, the reading word of ta (i.e., the maximally embedded tableau 
of ti) in Fig. [5]is7r= [78235416] G &s- Similarly, the reading word of t^ 
(i.e., the maximally embedded tableau of ^2) in Fig. Islis r= [78253416] G 

we see that tt and r are Knuth-equivalent by 
3, y = 4, and z = 5. Fig. |6] shows the procedure for 
constructing insertion tableaux for both vr and r. We also see that vr and r 
are P-equivalent. 



©g. By Definition 3.13 iii) 
the choice of x 



6. Application 

This section presents an application of priority-based task reassignments 
in a 2D-HMS. We first introduce the sequential task assignment problem in 
a 2D-HMS. 

The sequential task assignment problem in a 2D-HMS is defined as fol- 
lows. A set of m tasks Tm = {1,2,..., m} with the precedence relationship 
1 —)■ 2 —)■■■■—!• m are to be assigned to a set of m heterogeneous processors 
in a 2D-HMS of shape X \- n {m < n) and executed sequentially without 
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gaps. We assume that each task is assigned to each processor in a 2D-HMS 
bijectively, where a 2D-HMS is dedicated for a sequential task assignment. 
Tasks are heterogeneous, and their priorities are assigned by their resource 
requirements in which the higher task priority indicates the larger resource 
requirement. Find a bijective task assignment between T^ and a set of m 
processors in the 2D-HMS of shape X \- n {m < n) in such a way that the 
task turnaround time is minimized. 
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Fig. 7. Comparisons of task reassignments. 



By Definition 2.6, a sequential task assignment represented by A-(HMTjj) 



of normal shape satisfies the necessary conditions for being an optimal task 
assignment in a 2D-HMS. Further, an optimal task assignment is represented 
by A-(HMTj j) of the fixed processor priority in which task ID and proces- 
sor ID are the same for each cell (see Fig. [TJ^a)). Unless a sequential task 
assignment is represented by A-(HMTjj) of the fixed processor priority, we 
only concern the necessary conditions for being an optimal task assignment. 
We define a descent pair of a generalized A-(HMTj j), which breaks the first 
necessary condition for being an optimal task assignment in a 2D-HMS (see 



Definition 2.6) 



Definition 6.1. Let A-(HMTj j) be a generalized hierarchical 2D mesh tableau. 



Suppose HMTij ^t HMTi_ij (respectively, HMT.j ^t HMT 
{{i-^:j):{hj)} (respectively, {{i,j-l] 
generalized A-(HMTj j). 



ij_i). Then, 
, j)}) is called a descent pair of a 



If a descent pair occurs in a sequential task assignment represented by 
a generalized A-(HMTj^j), it always has the better task assignment in terms 
of task turnaround time. For example, swapping tasks on a descent pair 
reduces task turnaround time for a sequential task assignment in a 2D-HMS. 
It follows that if a sequential task assignment cannot be represented by (a 
standard) A-(HMTjj), it is not a necessarily optimal task assignment. We 
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see that the task assignment represented by Fig. JTJ^b) has a descent pair in 
a generahzed A-(HMTjj), i.e., HMTi^2 -<t HMTi^i. Now consider priority- 
based task reassignments for a sequential task assignment in a 2D-HMS. We 
see that the task completion sequence is simply (1,2,..., m) for a sequential 
task assignment of m tasks. Note that a greedy task relocation policy keeps A- 
(HMTj j) from occurring any descent pair for a sequential task reassignment. 



If we assume that every 2D-HMS is consistent (see Definition 2.2) in which 



task relocations are cost-free, we have Proposition 6A_ 

Proposition 6.1. Let Aq be a sequential task assignment for m {m > 
2) tasks represented by X-{HMTij) of normal shape. Let Ti be the task 
turnaround time for Aq without task relocation, and T2 be the task turnaround 
time with the task reassignment sequence {Ak)^^Q (see Algorithmlm. Then, 
T2 is less than Ti, i.e., T2 < Ti. 

Proof. It suffices to show that each task relocation allows each task to re- 
duce its task execution time for a sequential task assignment in a 2D-HMS. 
Let ti^x be the task execution time of task i on processor (or cell) x in A- 
(HMTj j) before task relocation. After task relocation, task i is relocated 
from processor x to its adjacent processor x' such that P(x') -<p P(x), where 
P{x') -<p P{x) means that an execution rate of processor x' is higher than 
that of processor x. Thus, ti^^' < ti^x- Since each task reassignment consists 
of iterative task relocations by Algorithm [6} we conclude that T2 < Ti. D 

Now consider a sequential task assignment in a 2D-HMS represented by 
A-(HMTjj) of skew shape (see Fig. [Wc)). A traditional task relocation mech- 
anism [3j allows the submesh to slide up and become a new task reassignment 
(see Fig. l7](d)). Note that a simple sliding mechanism does not often result 
in the task reassignment having the form of a standard hierarchical 2D mesh 
tableau. Rather, a descent pair may occur and become a generalized hi- 
erarchical 2D mesh tableau. Recall that the task relocation mechanism in 
Algorithm [7] results in the task reassignment represented by a hierarchical 
2D mesh tableau of normal shape (see Fig. [We)). For example, let Aq be an 
initial task assignment represented by A-(IIMTjj) of skew shape in Fig. [Wc). 
Since Aq has four empty processors in the top-left corner of A-(HMTj ,,■), the 
task reassignment sequence is (Aq, Ai, y42, ^3, ^4) by Algorithm [7J In the 
process of task reassignments, the greedy task relocation policy keeps a hi- 
erarchical 2D mesh tableau from occurring any descent pair. It follows that 
the resulting task reassignment ^4 in Fig. [We) is a necessarily optimal task 
reassignment for Aq in Fig. U{c). 
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7. Conclusions 

In this paper we liave presented a novel approach to representing priority- 
based task reassignments in a hierarchical 2D mesh-connected system (2D- 
HMS). A priority-based task reassignment in a 2D-HMS is represented by 
a hierarchical 2D mesh tableau A-(HMTj .,•) in which task relocations under 
the greedy task relocation policy are reduced to a jeu de taquin slide on A- 
(HMTij). In case we ignore task relocation costs, the hierarchical nature of 
A-(HMTj j) allows each task relocation to improve the task turnaround time. 
Some comparisons are given between the traditional approach and our ap- 
proach to task relocations in a 2D-HMS. For a priority-based task assignment 
represented by A-(IIMTjj), a traditional approach may generate a descent 
pair for task relocations, which means that it is not a necessarily optimal 
task reassignment. We have shown that our approach to task reassignment 
keeps A-(HMTj j) from occurring any descent pair for task relocations. 

To the best of our knowledge, this paper is the first attempt to study 
task assignments and reassignments in 2D mesh-connected systems by using 
tableaux. However, the assumptions that we have made about priority-based 
task reassignments in a 2D-HMS still need to be relaxed. For example, we 
assumed that communications and task relocations are cost-free in a 2D- 
HMS. Priority-based task reassignments in a 2D-HMS with more relaxed 
assumptions (e.g., non-zero task relocation costs) are left for future work. 
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